III. AMENDMENTS TO THE CLAIMS 

The following listing of claims replaces all prior versions, and listings, of claims in the 
application: 

1 . (Previously Presented) A computer implemented method of producing a linguistic dictionary, 
the method comprising: 

obtaining orthographic variations of dictionary words; 

explicitly storing substantially all orthographic variations of words in a finite state 
transducer database, and 

generating, for each of the orthographic variations, a cut and paste code, which indicates 
how many characters should be cut from the end of a surface form of a word and pasted to 
produce a particular variation, extended by a gloss code that indicates whether at least part of the 
orthographic variation should be converted between upper and lower case; 

storing, for each of the orthographic variations, the associated extended cut and paste 

code. 

2. (Original) The method of claim 1, wherein the gloss code further indicates whether conversion 
should be performed between each single and double character sequence in the orthographic 
variation. 

3. (Original) The method of claim 1, wherein the gloss code indicates one of (i)-(vii): 

(i) Do nothing; 
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(ii) Convert first character to upper case; 

(iii) Convert first character to lower case; 

(iv) Convert word to lower case; 

(v) Convert word to upper case; 

(vi) Convert word to upper case and replace each single character sequence with 
equivalent double character sequence; and 

(vii) Convert word to lower case and replace each double character sequence with single 
characters. 

4. (Original) The method of claim 1, further comprising: 

storing, for each word having an accented character: 

a word having a composite form of the accented character; and 

a word having an expanded form of the accented character that includes a base character 
and an accent character. 

5. (Currently Amended) A computer implemented linguistic dictionary comprising: 

a computer, having: 

a finite state transducer database for explicitly storing substantially all orthographic 
variations of words H ; and 

a means of storing wherein the database further stores, for each of the generated 
orthographic variations comprising: [,] a cut and paste code, which indicates how many 
characters should be cut from the end of a surface form of a word and pasted to produce a 
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particular variation, extended by a gloss code that indicates whether at least part of the 
orthographic variation should be converted between upper and lower case. 

6. (Original) The linguistic dictionary of claim 5, wherein the extended gloss code further 
indicates whether conversion should be performed between each single and double character 
sequence in the orthographic variation. 

7. (Original) The linguistic dictionary of claim 5, wherein the extended gloss code indicates one 
of(i)-(vii): 

(i) Do nothing; 

(ii) Convert first character to upper case; 

(iii) Convert first character to lower case; 

(iv) Convert word to lower case; 

(v) Convert word to upper case; 

(vi) Convert word to upper case and replace each single character sequence with 
equivalent double character sequence; and 

(vii) Convert word to lower case and replace each double character sequence with single 
characters. 
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8. (Original) The linguistic dictionary of claim 5, wherein the database stores, for each word 
having an accented character: 

a word having a composite form of the accented character; and 

a word having an expanded form of the accented character that includes a base character 
and an accent character. 

9. (Currently Amended) A computer program product encoded on a computer readable storage 
medium comprising computer program means executable by a computer for performing 
substantially the steps of: 

obtaining orthographic variations of dictionary words; 

explicitly storing substantially all orthographic variations of words in a finite state 
transducer database, and 

generating, for each of the orthographic variations, a cut and paste code, which indicates 
how many characters should be cut from the end of a surface form of a word and pasted to 
produce a particular variation, extended by a gloss code that indicates whether at least part of the 
orthographic variation should be converted between upper and lower case; 

storing, for each of the orthographic variations, the associated extended cut and paste 

code. 

10. (Original) The computer program product of claim 9, wherein the extended gloss code further 
indicates whether conversion should be performed between each single and double character 
sequence in the orthographic variation. 
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1 1 . (Original) The computer program product of claim 9, wherein the extended gloss code 
indicates one of (i)-(vii): 

(i) Do nothing; 

(ii) Convert first character to upper case; 

(iii) Convert first character to lower case; 

(iv) Convert word to lower case; 

(v) Convert word to upper case; 

(vi) Convert word to upper case and replace each single character sequence with 
equivalent double character sequence; and 

(vii) Convert word to lower case and replace each double character sequence with single 
characters. 

12. (Original) The computer program product of claim 9, further comprising computer program 
means for storing, for each word having an accented character: 

a word having a composite form of the accented character; and 

a word having an expanded form of the accented character that includes a base character 
and an accent character. 

13. (Previously Presented) The method of claim 1, wherein the extended code is adapted to allow 
a single orthographic variation to indicate a variation for a plurality of root words. 
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